45 results found.
Speech
Corpus,
Language Type:
Multilingual
Languages:
Dari Farsi Levantine Arabic Pashto Urdu
Availability:
LDC
License:
LDC
Size:
284 GByte Production Status:
Existing-used
Use:
Language Identification
-
Paper title:Attention based Hybrid I-vector BLSTM Model for Language Recognition
-
Paper track:4.1 Language identification and verification, lang/Poster Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Anand Mohan | RATS | /N |
Documentation:
Yes, English, Yes
Written
Corpus,
Language Type:
Multilingual
Languages:
Bengali Hindi Kannada Sanskrit Telugu Urdu
Availability:
Freely Available
License:
Size:
None MByte Production Status:
Existing-used
Use:
-
Paper title:Analysing cross-lingual transfer in lemmatisation for Indian languages
-
Paper track:Short paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Kumar Saunack | SIGMORPHON 2019 Shared Task 1 dataset | /N |
Documentation:
NoneLanguage Type:
Multilingual
Languages:
Urdu
Availability:
Freely Available
License:
Creative Commons
Size:
95.4M words Production Status:
Existing-used
Use:
Language Modelling
-
Paper title:Urdu Word Embeddings
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Samar Haider | University of Engineering and Technology, Lahore | PK |
| Main Contact | Samar Haider | University of Engineering and Technology, Lahore | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Urdu
Availability:
From Owner
License:
Creative Commons
Size:
100000 words Production Status:
Newly created-in progress
Use:
Machine Learning
-
Paper title:Urdu Word Embeddings
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Samar Haider | University of Engineering and Technology, Lahore | PK |
| Main Contact | Samar Haider | University of Engineering and Technology, Lahore | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Urdu
Availability:
Freely Available
License:
CC-BY-SA-NC
Size:
<Not Specified> Production Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:A Tagged Corpus and a Tagger for Urdu
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Bushra Jawaid | Institute of Formal and Applied Linguistic, Charles University in Prague | NL |
| Author 2 | Amir Kamran | Institute of Formal and Applied Linguistic, Charles University in Prague | CZ |
| Author 3 | Ondřej Bojar | Charles University in Prague, Faculty of Mathematics and Physics | CZ |
| Main Contact | Bushra Jawaid | University of Amsterdam, ILLC | None |
Documentation:
Documentation will be made available in English with the release of tagger.Language Type:
Trilingual
Languages:
English Hindi Urdu
Availability:
<Not Specified>
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Newly created-in progress
Use:
<Not Specified>
-
Paper title:Measuring the Divergence of Dependency Structures Cross-Linguistically to Improve Syntactic Projection Algorithms
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Ryan Georgi | University of Washington | None |
| Author 2 | Fei Xia | University of Washington | None |
| Author 3 | William Lewis | Microsoft Research | None |
| Main Contact | Ryan Georgi | University of Washington | US |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Urdu
Availability:
Freely Available
License:
CC-BY-SA-NC
Size:
95.4M OtherProduction Status:
Newly created-finished
Use:
Language Modelling
-
Paper title:A Tagged Corpus and a Tagger for Urdu
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Bushra Jawaid | Institute of Formal and Applied Linguistic, Charles University in Prague | NL |
| Author 2 | Amir Kamran | Institute of Formal and Applied Linguistic, Charles University in Prague | CZ |
| Author 3 | Ondřej Bojar | Charles University in Prague, Faculty of Mathematics and Physics | CZ |
| Main Contact | Bushra Jawaid | University of Amsterdam, ILLC | None |
Documentation:
Documentation will be made available in English with the release of data.Language Type:
Trilingual
Languages:
American English Mandarin Chinese Urdu
Availability:
From Owner
License:
<Not Specified>
Size:
2.3 <Not Specified>Production Status:
Newly created-in progress
Use:
Language Modelling
-
Paper title:Extending the MPC corpus to Chinese and Urdu - A Multiparty Multi-Lingual Chat Corpus for Modeling Social Phenomena in Language
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Ting Liu | <Not Specified> | None |
| Author 2 | Samira Shaikh | <Not Specified> | None |
| Author 3 | Tomek Strzalkowski | <Not Specified> | None |
| Author 4 | Aaron Broadwell | <Not Specified> | None |
| Author 5 | Jennifer Stromer-Galley | <Not Specified> | None |
| Author 6 | Sarah Taylor | <Not Specified> | None |
| Author 7 | Umit Boz | <Not Specified> | None |
| Author 8 | Xiaoai Ren | <Not Specified> | None |
| Author 9 | Jingsi Wu | <Not Specified> | None |
| Main Contact | Ting Liu | ILS, University at Albany | US |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Urdu
Availability:
Freely Available
License:
GNU
Size:
9.84 MByte Production Status:
Newly created-finished
Use:
Summarisation
-
Paper title:Urdu Summary Corpus
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Muhammad Humayoun | Faculty of Information Technology, University of Central Punjab, Lahore, Pakistan | PK |
| Author 2 | Rao Muhammad Adeel Nawab | Department of Computer Science, COMSATS Institute of Information Technology, Lahore, Pakistan | PK |
| Author 3 | Muhammad Uzair | Alumni student, Department of Computer Science, COMSATS Institute of Information Technology, Lahore, Pakistan | PK |
| Author 4 | Saba Aslam | Alumni student, Department of Computer Science, COMSATS Institute of Information Technology, Lahore, Pakistan | PK |
| Author 5 | Omer Farzand | Alumni student, Department of Computer Science, COMSATS Institute of Information Technology, Lahore, Pakistan | PK |
| Main Contact | Muhammad Humayoun | Faculty of Information Technology, University of Central Punjab, Lahore, Pakistan | None |
Documentation:
Yes, English, YesLanguage Type:
Multilingual
Languages:
Urdu
Availability:
<Not Specified>
License:
Open Source
Size:
199546 Production Status:
Existing-used
Use:
Multiple NLP tasks
-
Paper title:Improvised and Adaptable Statistical Morph Analyzer (SMA++)
-
Paper track:Short Paper
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Saikrishna Srirampur | IIIT Hyderabad | IN |
| Author 2 | Deepak Kumar Malladi | IIIT Hyderabad | IN |
| Author 3 | Radhika Mamidi | IIIT-Hyderabad, Professor | None |
| Main Contact | Saikrishna Srirampur | IIIT Hyderabad | None |
Documentation:
dl.acm.org/citation.cfm?id=2392773




